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Abstract 

Markov width of a graph is a graph invariant defined as the max- 
imum degree of a Markov basis element for the corresponding graph 
model for binary contingency tables. We show that a graph has 
Markov width at most four if and only if it contains no as a minor, 
answering a question of Develin and Sullivant. We also present a lower 
bound of order Q(n 2 ~ £ ) on the Markov width of K n . 



1 Introduction 

A contingency table is a d\ x d,2 x • • ■ x d n array of non-negative integers. 
Contingency tables are used to record and analyze the relationship between 
n discrete random variables X\, X2, ■ ■ ■ , X n , where Xk takes values in the 
set [dk] = {1, 2, . . . , dk} for k — 1, 2, . . . , n. In hierarchial models a simplicial 
complex A on {1, . . . , n} encodes interactions between the variables. One can 
estimate how well empirical data in the given table fits a hierarchial model 
by comparing statistics of this table with statistics of a random non-negative 
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integral table with the same set of marginals. In [Tj an algebraic approach 
for generating such a random table has been presented. This approach can 
be informally summarized as follows. A finite set of moves, such that any 
two tables with same set of marginals are connected by a sequence of such 
moves, is computed. Such a set of moves is called a Markov basis. Given 
a Markov basis, a random table is generated by performing a random walk 
using the moves in the basis. 

Thus, description of Markov bases of a given model is of interest and has 
attracted attention of researchers in recent years. For a more detailed intro- 
duction see [21 [6] . In this paper, following [2], we concentrate our attention on 
binary graph models, i.e. hierarchial models of 2 x • • • x 2-contingency tables, 
where the simplicial complex A, which encodes the variable interactions, is 
a graph. 

Let us now give a formal definition of a Markov basis of a binary graph 
model. For a finite set X = {x\,X2, . . . , x n } let Z(X) ~ Z 2 ™ denote the 
integral lattice with an orthonormal basis ea,(xi)a(x2)...a(x„) indexed by binary 
labeling a : X — > {0, 1} of X. For a subset Y C X there exists a natural 
projection map tix^y '■ ZiX) — > Z(Y) defined as the linear extension of the 
mapping e a — > e a \ Y . In most cases the original set X will be understood from 
context and we will frequently abbreviate tix^y to ny- Similarly, we will 
abbreviate 7i~{ a ,b} to ir ab . 

Given a graph G let 

vr G : Z{V{G)) -> (g) Z{V{e)) 

e&E{G) 

be a linear mapping obtained as the product of the maps 7Ty, ij G E. The 
mappings it^ correspond to 2-way marginals of a 2 x ■ ■ • x 2 table. A finite 
subset B C Ker(7r G ) is a Markov basis if for every non-negative integral 
vectors t>i,t>2 G Z(V(G)) with ira(vx) = irafa), there exist u±, . . . ,U£ G ±B 
satisfying the following: 

£ £' 

Vi + Uk = v 2 and v\ + Uk is non-negative for all 1 < £' < £. 

k=l k=l 

The Markov width fi(G) of G is the smallest integer k such that there exists 
a Markov base B of G with ||f||i < 2k for every v G B where \\v ||i is the 
i^-norm of v. The motivation for considering Markov width as the measure 
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of complexity of the set of Markov bases of a binary graph model comes 
from the fact that a binary model can be alternatively defined as a binomial 
ideal. In this setting Markov bases correspond to generating sets of the ideal, 
and Markov width equals to the degree of the largest minimal generator of 
the ideal. We omit the precise definitions, as we do not make use of this 
reinterpretation in our arguments, and refer the reader to [21 E] for details. 
It is known [31 [EE] that n{G) = 2 if and only G is a forest, and that 
> 4, otherwise. In this paper we characterize graphs with fi(G) = 4, 
answering a question of Develin and Sullivant [2]. We show that n{G) < 4 
if and only if G does not contain a subdivision of the complete graph K4 
as a subgraph. Additionally, we investigate the Markov width of complete 
graphs. Develin and Sullivant [2] showed that the Markov width n{K n ) of 
the complete graph on n vertices is lower bounded by 2n — 2. We strengthen 
this lower bound, showing that fi(K n ) > Q(n 2 ~ e ) for every e > 0. 

2 Markov width of K \- free graphs 

We start by describing a standard construction used in inductive arguments 
on fQ-minor free graphs. Series-parallel graphs are graphs with two distin- 
guished vertices called poles, obtained from elementary graphs by a recursive 
construction. The simplest series-parallel graph is an edge uv with the two 
poles being its end-vertices. If G\ and G 2 are series-parallel graphs with poles 
u\ and Vi, and u 2 and v 2 , respectively, then the graph G obtained by identi- 
fying the vertex v± with u 2 is also a series-parallel graph and its two poles are 
the vertices u\ and v%. The graph G obtained in this way is called the serial 
join of G\ and G 2 . The parallel join of G\ and G 2 is the graph obtained by 
identifying the vertex U\ with u 2 and the vertex V\ with v 2 ; the poles are the 
identified vertices. The series-parallel graphs are precisely those that can be 
obtained from edges by a series of serial and parallel joins. The sequence of 
such joins leading to a construction of a graph G is called a series-parallel 
decomposition of G. The series-parallel decomposition of G is not unique. 
In our considerations, we will need the following (folklore) lemma. 

Lemma 1. Every 2 - connected K^-minor free graph is a series-parallel graph. 
IfG is a 2 - connected K^-minor free graph that is not a cycle, then there exists 
a series-parallel decomposition of G such that G is obtained by a parallel join 
of at least three series-parallel graphs. 
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Observe that if the last operation in a series-parallel decomposition of G is a 
parallel join, then G is 2-connected (the converse is also true). In particular, 
we can apply Lemma [1] to such a graph G. 

Finally, following standard graph theory terminology, we define a {u, v}- 
bridge B of G to be a connected subgraph of G such that either E(B) = {uv} 
or for some component C of G\ {u, v} the set E(B) consists of all edges of G 
with at least one end in V(C). Note that a series parallel-graph G with poles 
u and v can be obtained from the set of its {u, t>}-bridges by a sequence of 
parallel joins. 

In this section we characterize graphs with Markov width equal to four. 
The cycles have this property, as shown in [2J. 

Lemma 2 Q2J). If G is a cycle then fi(G) = 4. 

Throughout the proof of the main theorem of this section, we will use the 
following straightforward observations repeatedly, and so we state them as 
lemmas. For convenience, let Z($) = Z and ttx-+$(z) = \ \z\\i for z G Z{X). 

Lemma 3. Let X\ and X 2 be finite sets, let X = X x UX 2 and Y = X\ flX 2 . 
Let z G Z(X) and 1 G Z{X\) such that 7Tx-^y(z) — ^Xi-*y(z)- Then there 
exists z' G Z(X) such that irx-+Xi {%') — z, ttx^x 2 (z') = tcx^x 2 ( z ) and 
\\z - z'\\i = ||7rx-Xi(<z) -^Hi- 
Lemma 4. Let X\ and X 2 be finite sets and let X = X\ U X 2 . Let z, z' G 
Z(X) be such that irx-*Xi(z) = nx-*x i (z') f or i = 1,2. Then there exist 
vectors z = Zq, Zx, z 2 , ■ ■ ■ , Zi — z' G Z(X) such that \\zk — Zfc-i||i = 4 and 
ftx->Xi( z k-i) — ffx-tXiizk) f or i — 1)2 and k — 1, . . .,£. Moreover, if z and 
z' are non-negative, then z Q , Zi, . . . , ze can be chosen to be non-negative. 

Lemma 5. Let Xi and X 2 be finite sets, let X = X\ UX 2 and Y = X\ P\X 2 . 
Let Z\, z[ G Z(X\) andz 2 ,z' 2 G Z(X 2 ) such thatizx 1 -*Y{zx) = ^x 2 ^y(z 2 ) and 

7rx 1 ~^Y(z' l ) = KX 2 -+Y(Z 2 ). If 

W^X^Y^!) ~ Kx-l-*y{z't)\\\ = \\Z\-z[\\l= \ W ~ 41 |l > 

then there exist z,z' G Z(X) such that ttx^xX z ) = z i> Kx-+Xi{z') — z\ and 
\\z — z'\\x — | pi — 2i||i. 

We first show that the Markov width of every series-parallel graph is at 
most four. In fact we prove a slightly stronger and more technical result. 
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Theorem 6. Let G be a series-parallel graph with a vertex set X and poles 
u andv. If z,z' G ZiX) are two non-negative vectors with ttg(z) — ^g( z> ); 
then there exist non-negative vectors Zq, . . . , Z\ G Z(X) such that 

1. z — z, Z\ — z , 

2. n G (z) = n G (z k ) for every k = 0, . . . , A, 

3. | \zk — 2fc-i| |i < 8 for every k = 1, . . . , \, and 

4. if-K U v{ z k-i) ^ n uv (z k ), then \\z k - Zk-i\\i = 4. 

Proof. The proof proceeds by induction on the order of G, i.e., \X\. If |X| = 
2, then the graph G is an edge and thus z = z' . The claim readily follows. 

Assume now that G is a graph of order at least three obtained from 
graphs G± and G2 by a serial or a parallel join. Let X{ be the vertex set of 
Gi, i = 1,2. We distinguish four cases: 

• G is obtained by a serial join of G\ and G2. 

Let w be the vertex shared by G\ and G2] by symmetry, we can assume 
that the pole u is contained in G\ and v in G<i- We apply induction to 
Gi with z % = 7iXi( z ) and z' % = 70^(2/), for i = 1,2. Let, for i = 1,2, 
z^, . . . , z\ be the resulting sequences of vectors (note that by padding 
the sequences with 71^(2') at the end, we can assume that the sequences 
have the same length). 

We construct the required sequence Zo, . . . , Z\ as follows. We start 
by constructing a sequence of vectors We have now constructed a se- 
quence of vectors z ,. . . ,Z£ such that z = z, TCx 1 ( z k) = n Xl (zl) and 
7l x 2 ( z k) = 7! 'x 2 ( z ) for k = 1,2, . . . ,£, and Properties 2,3 and 4 from the 
lemma statement are satisfied. Let k G [1, l\ and assume the vectors 
zo, . . . , Zk-i have been defined. If ir U w( z l-i) = ^uw{ z l), apply Lemmai 
with X[ — X\, X' 2 — X 2 U {u}, z = Zk-i and z = z\ and set z k to be 
the resulting vector z' . Clearly, \\z k — z^-illi = \\ z \ ~ ^ xA z k-i)\\i = 
H4 - 4-illi - 8aD.dir uvw (zk-i) = 71-^(2*). Since tt Gi = 7TGi(4)' 
it follows iTG( z k-i) = TTG^fc)- In particular, Properties 2, 3 and 4 are 
satisfied in this step. 

If 7r UM ,(4_i) ^ n uw (zl), then \\z\ - 4-1 Hi = 4 b Y Property 4. By 
Lemma [3] applied with 2 = z k -\ and 2 = z\, there exists 2^ = z' such 
that 7^(2%) = z\, n X2 (z k ) = nx 2 ( z l-i) = Kx 2 ( z ) and H^-^-illi = 4. 
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Again, n G A z l-i) = n Gl (z\) implies that 7r G (z fe _i) = ir G (z k ). Since 
\\ z k ~ -2fc-i||i — 4, Properties 2, 3 and 4 are also satisfied in this step. 

We have now constructed a sequence Zq, . . . , zg such that z = z, 
Ttx^zp) = 7Tx 1 {z') and TCx 2 ( z e) = irx 2 (z). An analogous argument yields 
the existence of a sequence ze,...,z 2 £ such that nxiizu) = ^Xx{z'), 
Kx 2 { z 2i) = irx 2 ( z ') satisfying Properties 2, 3 and 4. By Lemma IU the 
sequence z , . . . ,z 2 e can be completed to a sequence z , . . . , z x such that 
Z\ = z' and \\z k — z k-i\ |i = 4 for k = £ + 1, . . . , A. Clearly, the resulting 
sequence has Property 2. Since the £i-norm of the vectors Zk — z k-i, 
> 2£, is four, the sequence z 2 i, ■ ■ ■ , Z\ has also Properties 3 and 4. 

• G is obtained by a parallel join of Gi and G 2 , uv is an edge of 
G, and G has at least three {u, v }-bridges. 

By permuting the order of the parallel joins in the series-parallel de- 
composition of G, we can assume that neither G\ nor G 2 is an edge. 
By symmetry, G\ contains the edge uv. Let G[ be G\ and G' 2 be G 2 
with the edge uv added. 

We apply induction to G[, z % = 7TXi( z ) an d z ' % = ir Xi (z'), i = 1,2. Let 
Zq, . . . , z\ be the resulting sequence of vectors (note that by padding 
the sequences with nxX 2 ') at the end, we can assume the sequences to 
have the same length). By Lemma [3], there exist a sequence of vectors 
Zq,..., z 2 £ such that n Xl (zi) = z\ and nx 2 (zi) = Zq for i = 0, . . . , £, 
7i Xl (zi) = z\ and 7Tx 2 i z i) — z f_ t for % — £+ 1, . . . , 21 and \ \zi — 1 1 < 8 
for i — 1, . . . ,2£. Clearly, this sequence has Properties 2 and 3. 

Lemma @] yields that there is a sequence z 2 p, . . . , z x such that z\ = z' , 
KXiizzt) = ■ ■ ■ = rrxX z ') and | \z k - z k -i\\i < 4 for k = 21 + 1, . . . , A. 
The presence of the edge uv in G implies that n uv (zo) = ■ ■ ■ = tt uv (z\) 
which yields Property 4. 

• G is obtained by a parallel join of G\ and G 2 and uv is not an 
edge of G. 

If n uv (z) = ir uv (z'), add the edge uv to G and proceed as in the previous 
case. Hence, 7r uv (z) ^ n uv (z'). Observe that there exists a (unique) 
sequence of vectors w , . . . ,w m G Z({u,t>}) such that w = tt uv (z), 
w m = nuv(z'), \\w r — uv-i||i = 4 for r = l,...,m and ^^(z') — 
7T w (^)||i = 4m. 

We now apply induction for Gi, z 1 = 7tXi(z) and z n = ttxXz'), i — 1,2. 
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Let Zq, . . . , z\ be the resulting sequence of vectors. By Property 4, 
there exist indices k l r such that ir uv (z l ki _ x ) = w r -i and it uv (zli) = w r for 
r = 1, ... ,m and % = 1, 2. Since it is possible to prolong the sequence 
by repeating some of the vectors several times, we can assume that 
kl — kf] let k r be their common value in the rest. 

By Lemma there exist vectors z\,...,z m and z' , . . . , z' m _ x such that 
nxi( z r) = 4r for i = 1,2 and r = l,...,m, nxAK-i) = 4 r -i for 
i = 1,2 and r = 1, . . . , m, and \ \z r — ^-llli = 4 for every r = 1, . . . , m. 
For convenience, set z Q = z and z' m — z'. 

Let G' be the graph obtained from G by adding the edge uv. The 
choice of the indices k r implies that iTG>(z r ) = iiG>(z' r ) for r = 0, . . . , m. 
In particular, we can apply the argument used in the previous case for 
G' with z and z' , G' with z\ and z[, . . . , G 1 with 2; r and z' r and con- 
catenate the obtained sequences of vectors. Clearly, the final sequence 
has Properties 1, 2 and 3. Because of the presence of the edge uv in 
G' , the mapping tc uv is constant inside each of the r sequences in the 
concatenation. Since \ \z r — -4-iHi = 4 for r = 1, ...,m, the resulting 
sequence also has Property 4. 

• G is obtained by a parallel join of G± and G2, uv is an edge of 
G and G has only two {u, v }-bridges. 

Clearly, one of the {u, v }-bridges is the edge uv. If G is a cycle, 
then there exist vectors z Q , . . . , z\ satisfying Properties 1, 2 and 3 by 
Lemma [2J Since uv is an edge, tc uv (z ) = ■ ■ ■ = 7i uv (ze) which implies 
that the sequence also satisfies Property 4. 

On the other hand, if G is not a cycle, then Lemma [1] implies that G 
has another series parallel decomposition, say with poles u' and v ' , such 
that G has at least three {u', v '}-bridges. Based on whether G contains 
the edge u'v' , we apply the arguments presented in the second case 
or the third case to obtain a sequence of vectors z , . . . , z\ satisfying 
Properties 1, 2, 3 and 4 with respect to u' and v'. Since uv is an edge of 
G and thus tc uv (z ) = ■ ■ ■ = Tr uv (z\), Property 4 also holds with respect 
to u and v. 

□ 

It is now easy to derive the main result of this section: 
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Theorem 7. The Markov width fi{G) of a graph G is at most four if and 
only if G does not contain as a minor. In particular, fi(G) = 4 if and 
only if G is not a forest and has no K4 as a minor. 

Proof. It is shown in [2] that ^(K^) = 6, and n(H) < n{H') if H is obtained 
from H' by a sequence of edge contractions and vertex deletions. Therrefore, 
the Markov width n(G) of every graph G containing K± as a minor is at least 
six. Moreover, if G is not a forest, then > 4 by Lemma [21 

On the other hand, it is easy to see that, the Markov width of a graph 
is the maximum of the Markov widths of its blocks (maximal 2-connected 
subgraphs). As every 2-connected i^ 4 -minor-free graph is series-parallel (in 
case of a forest, the blocks are single edges), the Markov width of every graph 
with no K4 minor is at most four by Theorem [61 □ 

3 Lower bound for complete graphs 

We will derive a lower bound on the Markov width of a complete graph 
from bounds on maximum density of clean triangulations of surfaces. A 
triangulation T of a surface is clean if every triangle of T is a face triangle. A 
triangulation T is 2- face- colorable if its faces can be colored with two colors 
in such a way that every two adjacent faces receive distinct colors, i.e., the 
dual graph of T is bipartite. For other, more standard, definitions related to 
triangulations of surfaces, we refer the reader to the monograph [TJ. 

Lemma 8. If there exists a clean 2- face- colorable triangulation T with n 
vertices and m edges, then the Markov width fi(K n ) of the complete graph 
K n is at least m/3. 

Proof. Assume that the faces of T are colored with red and blue and let 
G = K n be the complete graph on the same vertex set as T, which we 
identify with the set [n]. Remember that has a basis e a , where a 

ranges over all functions from [n] into {0, 1}, which could be considered as 
indicator functions of subsets of [n]. Let z r G £([n]) be defined as the sum of 
vectors e x {F) over all red faces F of T. In particular, | \z r \ |i = m/3. The vector 
z b G <£([n]) is defined analogously with respect to blue faces. It is not hard 
to see that ttg( zT ) — ^(-s 6 )- Let n G (z r ) = {z G <£([n]) | itg( z ) — n G( z r)} be 
the fiber of containing z r . We will show that Hciz 7 ") = {z r , z b } which will 
imply the statement of the lemma by the definition of the Markov width. 
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Consider z G Il G (z r ) and let Ai,...,A m / 3 be (not necessarily distinct) 
subsets of [n] such that 

m/3 

Z X(A) = ^2^x(A t )- 
i=l 

By considering irij(z r ) = ^ij(z) for i,j G [n] we see that if ij G E(T) then 
{i, j} belong to exactly one member of {A\, . . . , A m / 3 ) and, if ij (jL E(T) then 
belongs to no member of this family. It follows that no two sets A\ 
share more than one elements, and that every A{ forms a vertex set of a 
complete subgraph of T . Since T is a 2-face-colorable clean triangulation, T 
contains no subgraph isomorphic to K± and consequently \A^\ < 3. On the 
other hand, the sum of the sizes of A\, . . . , A m / 3 is independent on the choice 
of z G Ug(z t ) and we have 

m/3 

^2 |A| = m. 

i=i 

This implies that \Ai \ = \A 2 \ = ... = |^4 m /3| = 3. Hence, each Ai corresponds 
to a face of T. Therefore A ly . . . , A m / 3 is a collection of the vertex sets 
pairwise non-adjacent faces of T, such that every edge of T is contained in 
some face of the collection. There are only two such collections, namely 
the vertex sets of the red faces and of the blue faces of T. In particular, 
z G {z r ,z b }, as claimed. □ 

We now apply Lemma [8] to obtain a lower bound on fi(K n ). A simple 
triangulation satisfying the assumptions of Lemma [8] is a double- wheel drawn 
on the sphere: It has iV + 2 vertices and 3iV edges. Hence, Lemma [H] gives us 
that the Markov width of K n is at least n — 2. To obtain a superlinear bound, 
we need denser clean triangulations. Such triangulations of essentially opti- 
mal density were constructed by Seress and Szabo in [8]. For every e > 
and for every sufficiently large integer n, they construct a clean triangula- 
tion T n>e of some surface S with n vertices and n 2 ~ e edges. The constructed 
triangulations are 3- vertex-colorable. If the corresponding surface £ is ori- 
entable then the triangulation is 2-face-colorable, as the clockwise orders of 
the vertex colors around adjacent faces are different. If £ is non-orient able, 
it is possible to obtain a clean triangulation with 2n vertices and 2n 2 ~ £ edges 
which is 3-vertex-colorable by considering an orientable 2-cover of S and a 
corresponding clean triangulation. 

We can now infer from Lemma [H] the following: 
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Theorem 9. For every e > 0, there exists n such that fi(K n ) > n 2 £ for all 
n > n . 

4 Final Remarks 

1. Our definition of binary graph models differs from the standard one. 
We do not consider 1-way marginals corresponding to the vertices of 
the graph. This is a very minor distinction, as for graphs of minimum 
degree one the fibers of ttq remain unchanged, and a binary graph 
model of a disconnected graph can be considered as a toric fiber product 
(see [10]) of the models corresponding to its components. 

2. In [19] Sturmfels and Sullivant consider cut ideals of graphs. Those are 
binomial ideals which are closely related to binary graph models as 
pointed out in [9]. In particular, the Markov width fi(G) of a graph G 
equals to the maximum degree of a binomial appearing in a minimal 
generating set of the cut ideal Iq, where G is a graph obtained from G 
by adding a universal vertex, that is a new vertex joined by an edge to 
every vertex of G. 

Sturmfels and Sullivant conjecture that I G can be generated in degree 
four if and only if G has no K 5 minor. Our result can be interpreted 
as a partial result towards this conjecture, verifying it for all graphs 
containing a universal vertex. Also let us mention that Engstrom [I] 
recently proved another conjecture from [9] showing that a cut ideal Iq 
can be generated in degree two if and only if G has no K4 minor. 
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